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@ Synthetic peptides which Induce cellulv immunfty to the aids virus and akfs vhal proteins. 

(g) This invention relates to the identification of short p^ittde 
segments of AIDS virus proteins which elicit T cellular immunity, 
and to a method of inducing cellular immunity to native proteins 
of the AIDS virus l>y immunization with short synthetic peptides. 
Five potential peptides have been identified by searching for 
regions which can fold as a maximally amphlpathic heifoc. These 
may be useful to include in either a synthetic peptide- or 
recombinant fragment-based vaccine. 
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Description 

SYNTHEnC PEPTIDES WHICH INDUCE CELLULAR IMMUNITY TO THE AIDS VIRUS AND AIDS VIRAL 

PROTEINS 

The purpose of the present invention Is to develop a vaccine to prevent acquired immunodeficiency 

5 syndrome (AIDS) based partly or solely on synthetic peptides which produde T cell immunity. T cell immunity Is 
an important arm of defense against viral infections, but has hardly been studied for AIDS. Helper T cells are 
needed for an antibody response as well as for a cytotoxic T cell response and for inducing macrophage and 
LAK cell killing. While It Is not yet clear whether cellular Immunity or humoral immunity is the critical element In 
protection against a virus, the peptides of the present invention are particulariy suited for use in a vaccine 

10 capable of safely eliciting either type of immunity: (1 ) the peptides are synthetically produced, and therefore do 
not Include live vims, or a part of a live virus, or other sites which might produce deleterious effects (e.g., the 
site binding to T4 or the site producing cell fusion); (2) the peptides rm^ be used alone to Induce cellular 
Immunity; (3) the peptides may be used in conjunction with other molecules In order to induce antibody 
production or a humoral response: and (4) the peptides may be targeted for a particular type of T-cell 

15 response without the side effects of other unvifanted responses. 

Much effort has been devoted to the analysis of antibodies to AIDS virus antigens, but no previous studies 
have defined antigenic sites of this virus which elicit T cell immunity, even though such Immunity Is Important In 
protection against many other viruses. Analysis of immunodominant helper T cell sites has suggested that 
such sites tend to form amphipathic helices. Using an algorithm based on this model, two candidate T cell 

20 sites, envTI and env T2. were identified in the human T cell lymphotropic virus type lilb (HTLV-lllb) envelope 
protein that were conserved in other Human Immunodeficiency Virus (HIV) Isolates. Con-espond Ing peptides 
were synthesized and studied In genetically defined inbred and Fi mice for Induction of lymph node 
proliferation. After Immunization with a 425 residue recombinant envelope protein fragment, significant 
responses to native gp120 as well as to each peptide were observed in both Fi combinations studied, 

25 Conversely. Immunization with env T1 peptide Induced T cell Immunity to the native gp120 envelope protein. 
The genetics of the response to env T1 peptide were further examined and revealed a significant response in 
three of four Independent Major Histocompatibility haplotypes tested, an indication of high frequency 
responsiveness in the population. 
Identification of helper T cell sites should facilitate development of a highly immunogenic, canler-free 

30 vaccine that Induces both T cell and B cell immunity. The ability to elicit T cell immunity to the native AIDS viral 
protein by immunization with a 16-resldue peptide suggests that such sites represent potentially Important 
components of an effective AIDS vaccine. 

Since the discovery of the human immunodeficiency viruses, the causative agents of the acquired 
Immunodeficiency syndrome (AIDS), substantial progress has been made toward characterizing the viral 

35 genes and their products in infected cells. Though responsible for profound immunodeficiency, viral Infection 
In man consistently induces a detectable Immune response as evidenced by serum antiljodles to the major 
viral proteins. Studies of serum reactivity to specific viral proteins have revealed no consistent prognostic 
associations to date. Much of the host antibody response is focused on the envelope proteins gp120 and 
gp41. The ability of native gp120 or a large recombinant fragment to induce neutralizing antibodies has been 

40 demonstrated- Two apparently immunodominant antibody binding sites in the gp41 envelope protein have 
been defined at the level of small synthetic peptides. Though such antibody sites are cleariy of diagnostic 
importance and potentially of importance for vaccine design, the typical progression of AIDS In patients 
despite the presence of these antibodies suggests that effective T cell immunity is Important to the immune 
defense against this pathogen. 

45 An ideal vaccine is highly immunogenic, induces both T cell and B cell virus-specific Immunity, and Is free of 
in-elevant carrier proteins. While traditional approaches using whole virion or virion subunits can generally 
achieve this, practical considerations such as safety and availability of native antigen have led many to 
consider more highly engineered vaccine constructs for AIDS. Localization of immunodominant T cell and B 
cell recognition sites becomes critical if one wishes to design a vaccine based on recombinant proteins or 

50 synthetic peptides. A T cell repsonse to the gp120 envelope protein has been demonstrated recently by 
Zarling et aJ. Nature. 323: 344-345 (1986) In macaques Immunized with vaccinia constructs containing gp120 
coding sequence. However, identification and characterization of immunodominant T cell sites within this 518 
residue protein or other HIV proteins have not been reported. 
Antibodies typically recognize free antigen In native conformation and can potentially recognize almost any 

55 site exposed on the antigen surface. In contrast, typical CD4+ helper T cells recognize antigen only in the 
context of the Class II Major Histocompatibility (MHC) molecule and only after appropriate antigen processing 
usually consisting of proteolysis or denaturation. Additionally, the polyclonal T cell response Is focused only on 
relatively few discrete sites. This limited response is seen even for noneukaryotic proteins (e.g., Influenza 
hemagglutinin and staphylococcal nuclease) for which tolerance to homologous host proteins does not limit 

GO the number of antigenic sites. Therefore, it is important to find sites which do elicit T-cell immunity to AIDS viral 
proteins. The elucidation of features determining immunodominance residing both intrinsic and extrinsic to 
antigen Is the focus of much cun-ent basic and clinical Interest. Detailed characterization of immunodominant T 
cells sites has allowed exploration for general features. Such analysis led to the observation that 
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immunodominant T ce!) sites tend to have an amino acid sequence consistent with formation of an amphipothic 
helix with hydrophilic residues on one face and hydrophobic residues on the opposite face. In an amphipathic 
alpha helix, the hydrophoblclty varies slnusoldaHy with a. period of 3.6 residues p^r turn of the helbc or a 
frequency of 100** per residue. An amphipathic 3io helix has a period of 3 and a frequency of 120''. Based on 
this modei, an algorithm entitled AMPi-ll has been developed for identification of such sequences in proteins 5 
given only primary sequence data. 

Although Zarting et al.. Nature 323: 344 (1986). showed that T cell immunity to the AIDS virus envelope could 
be induced In mon)<eys using a recombinant vaccinia virus carrying tlie 06ne fpr the whole envelope protein, 
their study did not identify antigenic sites stimulating these T ceils. Also, vaccinia immunization has been 
discontinued in the United States because of danger of disseminated vaccinia infection and other sld^ effaots. 10 
A synthetic peptide vaccine would not carry any of these risks. Since it would be synthetic, there would be no 
risk of live AIDS virus contamination as might occur with a killed virus vaqolne. A synthetic peptide whteh does 
not contain sites responsible for syncytia formation might produce fewer side effects than a large recombinant . 
envetope protein containing such sites. ... 

15 

Summary of the Invention 

The present invention relates to a method of inducing cellular immunity to native proteins of the AIDS virus 
by immunization with short synthetic peptides. Five potential peptides are identified by searching for regions 
which can fold as a maximally amphipathic helix. Two of these are recognized by T ceils immune to the AIDS 
virus envelope protein. One of these was used to Immunize mice and can successfully induce T cell Immunity £0 
to the whole AIDS envetope protein (480 amino acid residues long) using this 16Hreskiue peptide In 3 of 4 
mouse strains tested. The Invention includes but is not limited to these specifio peptides and any synthetic 
peptides containing or overlapping these sequences or variations therein, including peptides with amino ackl 
substitutions that retain or enhance the activity documented. These can be used alone aa.a vaccine or in 
combination with other materials (e.g. primary Immunization with peptkle, boost with recombinant fragment). 25 
They can also he attached to sites binding neutraltdrtg antibodies to induce a neutraKzIng antibody response. 

The present invention Is critical to the manufacture of peptide vaccines capable of eltoltingT-ceil immunity. 
One aspect of the present inventton Is the discovery of certain traits whtoh seem to be common to most T-ceii 
stimulating protein segments. Such peptide vaccines should optimally be those protein segments which (a) 
have a propensity to form amphipathic alpha-4iellces; (b) do not have reglor^s with a propensity to coil 30 
formations; and (c) have a lysine at their COOH-termlnus. The last two observattons are of partk)ular use in 
manufacturing peptides vaccines; they Indicate where the synthetic peptides should, be terminated. 

Description of the Figures 

Figure 1 shows the HTLV-III gpl 60 envelope protein. The gp1 20 and gp41 proteins are shown along with 35 
recombinant proteins and synthetic peptides referred to in this study. Selected, restriction site bpatlpns 
are shown on the map at the top along with the shaded leader peptide (1-37) and transmembrane region 
(519-534). The precursor gp160 is cleaved to fonn the gp120 and gp41 proteins. The dimensions and 
locations of the RIO and PB1 recombinant proteins and the envTI and T2 synthetic peptides examined in 
this study are shown. The locations of known B cell epitopes (B1 and B2) in the gp41 protein are also 40 
shown. 

Rgure 2 shows the results of AMPHI analysis in the region of the envTI and envT2 sites. 

Figure 3 shows lymph node proliferation assays of l-fTLV-lli envelope gp120 and related recombinant 
and synthetic peptide antigens. Fi hybrid mice were immunized with either 10 micrograms of the large 
recombinant protein RIO (panels A and B labeled NP) or 3 narwmoles of peptide env T1 (panels C and D 45 
labeled PN) in 50 microliters of complete Freund's adjuvant (DIPCO, Detroit, Ml) at the base of the tali. 
Eight days later the draining inguinal and periaortic lymph nodes were removed and a single cell 
suspension prepared. Assays were set up in quadruplicate with appropriate antigen in 96~well plates 
with 3 X 10^ cells per well In complete medium consisting of RPM1 1640 with44<^ Eagle*s-Hanks' amino 
acid medium, lOP/o fetal bovine serum, 5 x 10 2-mercaptoethanol, 2mM fresh-frozen L-glutamlne SO 
(Gibco. Grand Island, NY), 100 U/ml penicillin, and 100 microgramft/mi streptomycin (Glbco). Plates were 
incubated for 4 days at 37** in a 5^ COa Incubator, pulsed with 1 mIcroCi of [^Hl-thymldine (New England 
Nuclear, Boston MA) and harvested 18 hours later onto glass fiber paper. Thymidine incorporation into 
DNA was then quantitated by liquid scintillation counting. 

The geometric mean and standard error of the mean for each group were determined and the no antigen 55 
background subtracted to obtain the delta cpm. The no antigen backgrounds were: A: 21 ,771 ; B: 17,844; 
C: 30.674; D: 29.298. The confidence inten/als for the background values are shown at the zero position of 
each vertical axis (n -8). The panels are scaled according to the magnitude of the PPD positive control 
response. 

Figure 4 shows the response of env T1 peptide immune lymph node cells tp gpl 20 ai\d related antigens 60 
in the independent parental mouse strains. C57BU6, (^H/HeJ, A.SW, and BALB/c mice were immunized 
with the 16-residue env T1 peptide and lymph node proliferation assays were performed as described in 
the legend to Rgui'e 3. SW102 is a peptide representing sperm wtiaie myogtobln residues 102-118. The no 
antigen (0) and PPD negative and positive controls are shown in the first position of each panel. The 
panels fare scaled according to the magnitude of the PPD response. The no antigen backgrounds were: 65 
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B6: 16.334. C3H: 74^53: A.SW: 28,771 ;BALB/c: 34.600. ^ . 

Figure 5 shows alpha helical net representaHon of the env T2 and env T1 sites. This display can be 
thought of as slicing the cylinder of the helix lengthwise down one face, opening and flattening It There 
are 3 6 residues per turn of the helix. The hydrophobic residues are shaded. Residues common to both 
5 sides are boxed. Regions outside of the peptides are shaded according to hydrophoblcWes of residues In 

the gp120 sequence. 

Detailed Description of Pretended Embodiments of the Invention 
Prediction of T-cell antigenic peptides has Important implications for the development of artlficai vaccines. 
10 Such vaccines are particularly useful In diseases like leprosy, caused by organisms which are hard to culture 
and for which the cellular arm of the Immwie system Is the principal defense. Even when antibody production is 
the primary goal of vaccination, a secondary or anamnestic response requires the induction of helper T-cell 
immunity. Prediction of peptides for use as vaccines requires discovery and confirmation of properties 
con-elating with T-cell antigenicity. One of the purposes of this Invention Is to use such properties in a process 
15 capable of reliably predicting T-cell stimulation by a protein segment. . . 

T-cell immunity is not only important in the defense against many viral Infections: T-cell help is also 
necessary for a memory antibody response. However, only a limited number of segments of protein antigens 
elicits T-cell immunity. None have previously been found for any proteins of the AIDS virus. The T-cell antigenic 
sites from AIDS viral proteins covered by the present Invention should be very useful In any vaccine developed 
jSO for this vinjs. The present Invention also shows that one of the peptides elicits T-cell immunity to the native 
AIDS gp120 envelope protein. Thus, these peptides may constitute all or part of a synthetic vaccine. Also, a 
fragment vaccine made by recombinant DNA or other technology should be designed.to Include these sites. 

The experimental peptides containing the Immunodominant sites are defined herein as antigenic sites. 
"Antigenicity" in this Invention always refers to T-cell antigenicity. 
25 In vivo, an antigenic protein probably passes through three main steps before raising a helper T-cell 
responii: (a) 'processing": an antigen-presenting cell (APC), usually a macrophage, dendritic or B cell, 
ingests the protein and then digests it Into smafler peptides; (b) -presentation": these peptides are then 
presented to T-cells. probably in conjunction with a Class II Major HistocompatiblBty Complex Protein on the 
APC surface; and (c) "recognition": a helper T-cell receptor ttien recognizes some combination of peptide and 
30 Class II Protein, and initiates a T-cell response. w « u. u j 

Two antigenic properties are thought to contribute to this process, amphlpalhiclty and Alpha-hellcity, based 
on the findings In this invention. . 

A structure is amphipathic when it has both a hydrophobic portion and hydraulic portion. A peptide is 
segmentally amphipathic when the peptide contains at least two disjoint subpeptides. one hydrophobic, the 
as other hydrophilic. A peptide is aipha-amphipathic if. when the peptide is put into an alpha-helical confonnation. 
one side of the alpha-helix Is hydrophobic, the other side hydrophilic. A peptide is helically amphipattiic if. 
when put into an alpha or 3io helix, or similar helical structure, one side of the helix Is hydrophobic and the 
other side hydrophilic. Both segmental amphipathlclty and helical-amphipathicity are believed to contnbute to 
T-cell antigenicity, tiiough opinions about their relative Importance differ. 
40 AMPHI was used to analyze the gp120 envelope protein of the HTLV-IIIb isolates of HiV for sequences 
consistent with formation of amphipathic heHces as potential T cell sites. Sites vrere ranked according to the 
apparent strength of helical amphipathicity as reflected in tiie Amphipathic Score, and frequencies were 
examined for consistency. Sites were furtiier selected for occunence in constant regions of gp120 (based on 
a comparison of the sequence of six isolates) and for absence of N-linked glycosylation sites. AMPHI 
45 parameters for the two most favorable sites are shovm In Figure 2. Candidate T cell sites were selected by 
including appropriate flanl<ing residues. Candidate T cell sites env T1 and env T2 were defined as residues 428 
through 443 and 112ttirough 124. respectively. The standard epitope nomenclature employed consists of viral 
Isolate, protein designation, site type, and assigned number or residue number (e.g. HTLV-III (bH10)env T1), 
Synthetic peptides con-esponding to these sites were prepared by solid phase peptide syntiiesls. Peptides 
SO wer© synthesized using standard methods of solid phase peptide synthesis on a Vega 250 peptide synthesizer 
using double dIcyclohexylcariDOdiimide mediated couplings. HOBT preactivation couplings were perfonmed 
when coupling Gin or Asn. The standard t-boc/benzyl amino acid protection strategy was employed with side 
chain protection of the following amino acids: Asp(O-benzyl). Glu(O-benzyl), Hls{tosyl), Lys(2-chlorobenzy- 
loxycarbonyl). Ser(benzyl). Trp(fomiyl). Tyr(2,6-dichlorobenzyl). The extent of coupling was monitored using 
$S the qualitative ninhydrin test and recoupling performed when less than 99.4«Vb coupling was observed. 
Peptides were cleaved from the resin using the low/high hydrogen fluoride (HF) method (J.P. Tarn, W.F. Heath, 
and R.B. Merrifield, J. Am. Chem. Soc. 105. 6442 (1983)). For peptide env T2 standard HF cleavage was 
employed as removal of the tryptophan formyl protecting group was found not to be required for antigenic 
activity. Peptides were purified to homogeneity by gel filtration on Biogel P4 In 9<Vb fomiic acid followed by 
60 reverse phase HPLC as described previously. Composition was confirmed and concentration determined by 
amino acid analysis. Native gp120 was purified by selective detergent extraction and immunoaffinity 
chromatography followed by dialysis. The recombinant proteins RIO and PB1 were produced by cloning 
restriction fragments Kpnl(5923) to Bgl 11(7197) or Pvu 11(6659) to Bgl 11(7197) from the BH10 clone of HTLV-lllb 
Into the RepHgen Expression Vector (REV) followed by expression in E. coll and purification. Protein RIO 
65 contains residues 49 tiirough 474 and PB1 residues 294 through 474 of the envelope protein. 
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As a genetically defined model of an outbred population, there was studied the immune response to tliese 
proteins in (C57BL/6 x C3H/HoJ)Fi and (A.SW x BALB/c)Fi mice (H;^ bxk and l+^xd respectlveiy). This 
strategy provides for full H-2 expression and complementalion in tha contwt of: four different ^n 
t>act(grounds. T cells have been shown to be responsible for the prolFferallon observed In antigen specific 
lymph node proliferation assays and consequently such assays were employed as a measure of T ceR 5 
Immunity. 

Quantities of purified gp1 20 available precluded use in Immunization and thus the RIO protein containing the 
majority of the gp120 sequence In nong!yG03yIated^form waathe larg^ fmmunbgen used. Rgure 3, panels A 
and B, shows the lymph node proliferative response of R10 Immune mice tested with natlvte gp120, 
recombinant proteins, and synthetic peptide envTI. In both Fi hybrids a strong resfponse was observed not ra 
only to the immunogen RIO, but also to gp120. Therefore, the response was laiigely directed at envelope 
sequence and not at the iffolevant vector-derived flanking sequence encqded residues In the recombinant 
protein. Thus the recombinant R10-fragmentls an effective Immunogen for priming for a response to the native 
gp120. The response to the synthetic peptide env T1 Indicates tliat a significant component of the T ceil 
response to the 425 residue RIO is in fact focused on the 16 residue env T1 site, fn other experiments with R10 is 
immune lymph node cells a response to peptide env T2 similar to that to peptide env T1 was observed (Table 1). 

TABLE I 

RESPONSE TO ENV T2 PEPTIDE IN 20 
RIO (49-474) -IMMUNE HYBRID MIOE 
RELATIVE TO NATIVE gpl20 and env Tl PEPTIDE 



Hybrid Mice 



Antieren 


(B6 X C33H)Fi 


(A.SW X BALB/c)Fi 


gpl20 


69,738 


65,949 




(1.01) 


(1.07) 


env Tl 


17,686 


25,140 




(1-12) 


(1.15) 


env T2 


20,703 


23 ,332 




(1.18) 


(1.14) 


med i urn 


10,864 


13,381 




(1.09) 


(1.07) 



I^HJ-thymidine incorporation is shown for each group expressed as the geometric mean counts per miriute 
with the standard error term for quadrupiicate samples shown In parenthises. N - 8 for the medium controls. 
Antigen concentrations were 0.076 micromolar for Qp12d and 4.8 micromolar for the env Tl and T2 peptides. 

Given that Immunization with a large fragment spanning most of the gp120 sequence elicits a response 
partially focused on a small site defined by a synthetic peptide (a native immunogen/peptlde test-antigen or NP ^ 
experiment), we next asked whether Immunization with the synthetic peptide would elicit !mmur% to the 
native protein (a peptide Immunogen/natlve test-antigen or PN experin^nt). Immunogeniclty In the PN 
direction would appear to be a prerequisite for efficiency as a vaccine site. The results of such an experiment In 
the Fi mice Is shown in Rgure 3, panels C and D. Mice Immunized with env Tl peptide showed substantial 
immunity not only to the envTI immunogen but also to the native gp120 as well as to the recombinant proteins. ^ 
Thus a 16-residue synthetic peptide selected on the basis of amphlpathlctty can elicit T ceO immunity to the 
native AIDS virus protein. 

To further characterize genetic restriction of the response to env T1 there was studied the independent H-2 
disparate parental strains from which the Fi hybrids have been derived: C57BL/6, C3H/HeJ. A.SW and 
BALB/c. Mice wer6 Immunized with env T1 peptide and studied with native and peptide antigens. As shown in ^ 
figure 4. 0578176 (H-gb haplotype) was found be a low responder. whereas the other strains (H-2k. H-2y , and 
H-2d haplotypes) were intennedlate or high responders to the env Tl peptide. The response to native gp120 
paralleled that to the peptide. A corresponding pattern of responsiveness Is also observed In experiments 
using H-2 congenic strains of mice (Table II). Thus peptide env T1 represents a 16 residue peptide that.<an 
prime T cells for a secondary response to the 518 residue glycosylated native gpiao In multiple MHC ^ 
haplotypes. 
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TABLE II 

RESPONSE TO gpl20 IN ENV Tl 
PEPTIDE- 1 P/HVTONE H-2 CX)NGENIC MICE 

Congenic Strains 



Antigen 
PPD 



gpl20 
IS med i um 



B10.A(5R) 


BIO. BR 


B10.S(9R) 


B10.D2 


85,511 


100.872 


71,006 


44,564 


(1.06) 


(1.07) 


(1.05) 


(1.17) 


45,857 . 


69,456 


68,219 


64,858 


(1.02) - 


(1.05) 


(1.06) 


(1. 10) 


29,715 


40,639 '. 


22,863 


19,665 


(1.03) 


(1.04) 


(1.04) 


(1.06) 



pHl-thymidine incorporation Is shown for each group expressed as the geometric mean counte per minute 
with the standard error term for quadruplicate samples shown In parantheses. N=8 for fte medhjm^^ 
Antigen concentrations were 0.075 micromolar for gp120 and 32 micrograms per miWrter for PPD. 

An unexpected finding vras the striking cross reaction between env T1 and env T2 pe^des/The env Tl 
Immune cells responded to env T2 as well as t6 the Irfimurilzing peptide. Cross reactivity of env T2 was most 
pronounced on the H-ZK haplotype. Prompted by this finding, the two sequences were cornpared and there 
was a degree of homdogy which was even more evident wrtien considered In the context of possible alpha 
heBcal structure as shown in Rgure 5. Not only do envT1 and env T2 share the hydrophobic lle-i e-)Oa-Yyy-Trp 
duX on the hydrophobic face and the Lys on the hydrophilic face of the helK. but also the spatel 
relationship between these Is identical. Qln and acidic amino acids (Glu, Asp) neighboring ttie Lys are 
obseraed in both cases as well. The poor reactivity to peptide 102-118 of sperni vyrhale myoglobin which ^ 
derived from an unrelated protein and shares minimal homology with env Tl indicates thtrt the Proper^ °* 
being an amphipathio alpha-helical peptide is not sufficient for oross-reactivity. As an addrtiotral specific^ 
control. gp1 20. env Tl and env T2 were tested using lymph node eels Immune to an unrelated antigen, spenn 
whale myoglobin, and were found to be non-stimulalory (data not shown). ^ . 

Though species differences are certain to Influence the T cell repertoire, the molecules and mechanisms 
leading to a T cell response are conserved across species and thus the fectore delennining 
immunodomlnance are similar as well. The one helper T cell site (from influenza virus) ttiat has been 
characterized at the synthetic peptide level in man Is in fact immunodominant In mice as welt and has no amino 
acid sequence consistent with fonnation of a highly amphipathic alpha helix. ^ ^ , 

AnahKls of the various viral proteins in the thirteen HIV isolate sequences pubHshed shows the envelope 
gene to be the most variable. Such analysis reveals discrete variable and consented regions. A po yyalent 
vaccine might thus be appropriate. Both the env Tl and the env T2 sites are highly conserved in the WN 
sequences reported to date. The env T2 site is the more highly conserved of the two sites especially in the 
LAVeu isolate. The previously described B cell sites (B1 and 82 in Rgure 1) fall within conserved regions as 

"Induction of T cell immunity m^ contribute in several ways to protection against HiVlnfecBon. Though AIDS 
progresses despite the presence of detectable antibody to viral proteins In most patients, low titer neutralizing 
antibody has been demonstrated in many such patients. Neutralizing titere are substantially higher in healthy 
AIDS related complex patients and in HIV antibody positive hemophiliacs. Whether this relatorjship is rausal or 
simply con^elatlve is as yet unknown, if these or any other antibodies are in fact protective, provision of 
optimum T cell help at the time of immunization as well as when faced with an infectious ctellenge wouW 
appear essential. Substantial T ceU help should also be required for an effective cell mediated response to 
Infected cells. NK cells have been shown to selectively kill HIV infected cells in vitro. Given that a major mode of 
viral transmission in the infected patient is thought to be ceil to cell, a vaccine that pnmes helper T <»"s tor 
augmentation of NK cell and possibly lymphokine activated Wller cell activity is essential for an effective 
vaccine. Induction of virus-specific cytotoxic T lymphocyte (CTL) immunity, which also requires helper T cells 
is also desirable. While in some cases the detenninants recognized by cn. can in fact be defined by small 
peptides. In vivo expression of antigens such as in a vaccinia or adenoviral vector may be required tor efficient 
eUdtetion'SfidasSical CTL response. The MUm algorithm was developed to identify helper T ceU sites, and 
consequently its relevance to CTL specificity Is unknown. However. It does successfully identify the two 
characterized sites in influenza nucleoprotein recognized by human and murine CTL. 

The fact that helper T cell immunity can be induced with short peptides as well or better than with native 
protein stands in sharp contast to the situation with B cell immunity for v»hich tertiary struchnre te frequen^ 
important, and indicates that peptide vaccines aimed at T cell immunity are more successful than those tfmao 
at antibody production. In a synthetic peptide or recombinant fragment based construct, one could selacmrev 
include important helper T cell sHes. in multiple copies if desired, and exclude suppressor T cell sites which 
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can blunt the immune response. Sites associated with specific functions or possible undesirable side effects 
such as the CD4 binding site(8), the slte(s) mediating syncytia foimallon. or the neuroleuWn homology site, 
can be systematically included or excluded. For vacdnee designed to Induce ahtlb9dtee as weH as T can 
Immunity, Incorporation of pathogen-derived T cell sites along with Important B cell sftea obviates the need to 
chemically couple small peptides to Irrelevant carriers and disposes of coupling and carrier-derived problems. 5 
Thus the T cell sites Identified here are potentially Important components of an effective AIDS vaccine. 

The present Invention Includes a method of predicting which segments of a protein (along Its entire 
sequence. If desired) are antigenic. In other worcte, the fM^sent hwentJon Is a method of doterrhlnlng which 
sites of an entire protein sequence are recognized by T-cells (activate or stimulate t-cells). Application of this 
method is limited only by knowledge of the amino acid sequences of a protein, i.e., can be applied to any pro 1Q 
teln in the protein data base of the National Biomedical Research Foundation or any i^bteln whose sequence 
is subsequently published. Moreover, the analysis can be done using the amino acid setiuence translated from 
a DNA gene sequence, without ever isolating the protein. The l^kground experiments which made this 
process possible comprise. In their entirety, an examination of a number of properties to determine if a 
particular property or properties Is impilcated in T-cell stimulatk>n. 75 

The following properties were detennined to be fundamentally Important (with a high degree of slgnlfioance) 
In determining the potential Immunogentolty of certain protein sequences: 

a The helical amphipathtelty of segments atong the entire sequence of a protein: ' 

b. the confonnatlonal propensity of segments along the entire sequence of the protein; 

c. the presence or absence of helix-breakers in segments atong the entire secjuence of the protein; and 20 

d. the presence and location in the protein sequence of amino ackl reskilies which fEivor T^ii 
recognition. • . - 

These properties were used to develop an optimized ai^forittim for detecting T^ll antigenic sites (based on 
the amphipathic helix model) in a protein with known sequences. The optimum algorttiun kientifies 18 of 23 
known sites (75<Vb sensitivity), witii a high degree of significance (p<0.001). The success of the algorrthm also 25 
shows, that stable amphipathic structures such as amphlpattite helices are fundamentsdly important In 
detemnlning Immunodomlnance. The optimized algorithm enables the predication of Immunodominant T-celi 
sites on a protein. This prediction capability facilitates the rational design of synthetic vaccines, and facilitates 
other apprroaches to antigen speclfte T-cetl recognition. 

EXAMPLES ^ 



Example 1 

Ttie AMPHI algorithm was used to examine tiie IfUV-lll envelope protein amino acid sequence for sites with 
periodic variation in the hydrophobicity consistent with fomriatlon of an amphipathic helbc (alpha or 3io). S6 
Overiapping blocks of 11 residues were examined and resultant parameters assigned to the mkldle residue. 
The results for the residue 100-200 and 400^ regions encompassing the env T2 and T1 sites respectively 
appear on the left and right of Figure 2. TTie upper panels show the amphlpattite Index, a measure of Intensity of 
amphipathlcity. determined at a frequency of 100* or 120" per residue. The higher of the two Is shown. The 
lower panels display the frequency where the maximum amphipathic signal Is observed. The sites were 40 
selected based on their consistently high amphipathic Indices with maxima in the helical frequency range. The 
amino acid sequence of tiie indicated sites are: env T2(11 2-124): Hls-Qlu-Asp-ile-Ile-Ser-Leu-Trp-Asp-QIn- 
Ser-Leu-Lys; ehv T1(42M43): Lys-iSIn-lie-lle-Asn-Met-Trp-GIn-Glu-Val-Qly-Lys-Ala-Met-Tyr-Ala. 

Example 2 ^ 

Applying the algorithm is a major step in predicting the most probable Immunodominant sites that show 
amphipathic helical potency. The number and iengtii of sites along a specific protein depend on the 
hydrophobicity profile of that protein. There are proteins that show a high degree of amphipathte heKcal 
potency (and contain many predicted sites), while others are poor in amphipathte segments. After having 
predteted ail the possible amphipathic helical segments, the segments rnust be graded. The present invention so 
prefers tiie use of three factors for grading purposes: a) amphipattjic score (particulariy useful when 
comparing segments of the same length); b) the rarity of proline in heHces In general (except near the 
NHa-terminus), and in most of the helical antigenic sites In particular; and c) the appearar>ce of lysine at the 
carboxyl end in a large number of helical antigenic sites, it has been found that lysine as the ultimate or 
penultimate C-termlnal residue occurs much more frequently in Immunodominant sites. In short, a preferred 56 
sequence contains amphipathic segments with proline, If present, only near the N-termlnus. the lysine near tfie 
C-terminus. 

Another possible indteator Is the presence of N-glycosyletlon sites-tiiese sites are tndteatlvB of a less 
favorable candidate for an immunodominant site, because the T-cell epitope may be masked by the 
carbohydrate. 5^ 

Example 3 

Using the process of this invention, the following segments were predicted to be T-ceU stimulatton sites: 
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1) segment 93-127 

PheAsnMetTrpLysAsnAspMetValGluGlnMetHisGl^^ 

leSgrLeuTrpAspGrnSerLeuLysP roGyeValLysLeuThrProLeuC^sV . 

2) segment 415U37 \ . i, ... » . 
ThrLeuProCvsArQll eLvsGlnlielleAsnMetTrpGlnGiuValGiyLysAlaMetTyr^^ o.-a ott opa mqik^ 

The numbering sequence used In this invention con-esponds to Batner et al. Nature, 313^-M4 (19^). 
The underilned portion of each segment has been shown by other methods to be a T-cell stimulation site. 

^SoBSwing segments from the envelope region of the HIV genome have been predicted as candidates for 
T-cell stimulation sites:* 
segment 231-250 segment 615-652 
segment 307-331 segment 659-681 
15 segment 335-357 segment 777-808 

segment 553-574 segment 827-856 *«r r^n 

The following segments from the gag region of the HIV genome have been predicted as candidates for T-cefl 
stimulation sites: 

segment 5-22 segment 148-162 
20 segment 52-70 segment 284 - 309 

segment 93-110 segment 360-376 . . ^. o-.q..,t7 oox mcak\ 

The numbering sequence used in this invention conresponds to Ratner et al. Nature , 3l3.z?T-Z84 n5«»}. 
The amino acid sequences for these peptides are shown in Table 3. 
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TABLE 3 

From the env protein; 
Segment 93-127 

PheAs nMe tTr pLy sAs pA^s pMe tYa IGl upi nMe tgi sGluAgpI lei leSerLeu- 
TrpAspGlnSerLeuLysP r oQy s Va I Ly s LeuTh rProLeuCysVal " 

Segment 231-250 

LysThrPheAsnGlyThrGlyProCysThrAsnValSerThcYalplnCysThrHis- 

Segment 307-33 1 

I leArglleGlnArgGlyProGlyArgAlaPheValThrlleGlyLysUeGlyAsn 
MetArgGlnAlaHisC^s 

Segment 335-357 

ArgAlaLysTrpAsnAsnThrLeuLysGlnl leAspSerLysLeuArgGluGlnPhe- 

GlyAsnAsnLys 

Segment 415-437 

ThrLeuProCysArgl l eLysGlnl lei leAsnMe tTrpGlnGluValGlyLysAla- 
MetTyrAlaP ro .. ... 

Segment 553-574 

AsnAsnLeuLeuArgAlal leGluAlaGlnGInHi sLeuLeuGlnLeuThrValTrp-^ 
Gly I leLys 

Segment 6 15-652 

SerAsnLysSerLeuGluGlnlleTrpAsnAsnMetThrTrpMetGluTrpAspArg- 
GluIleAsnAsnTyrThrSerLeuI leHisSerLpuIleQluGluSprGlnA^nGln 

Segment 659-68 1 

GluLeuLeuGluLeuAspLysTrpAlaSerLeuTrpAsnTrpPheAsnl leThrAsn- 
TrpLeuTrpTyr 

Segment 777-808 

IleValThrArgl leValGluLeuLeuGlyArgArgGlyTrpfSluAlaLcsuLysTyr- 
TrpTrpAsnLeuLeuGlnfyrTrpSerGlnGluLeuLys 

Segment 827-856 

AspArgVallleGluValValGlnGlyAlaTyrArgAlalleArgHisUePFoArg- 
Argl leArgGlnGlyLeuGluArgl i^Leuljeu 

From the - gag protein; 

Segment 5-22 

AlaSerValLeuSerGlyQlyGlu^euAspArgTrpGluLysI 1^ 
Segment 52-70 

GluThrSerGluGlyCysArgqinlULeuGlyGlpLeu^ 
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Segment 93-110 

GluIleLysAspThrLysGluAlaLeuAspLysIleGluGluGluGlnAsnLys 
Segment 148-162 

SerProArgThrLeuAsnAlaTrpValLysValValGluGluLys 
Segment 284-309 

AspIleArgGlnGlyProLysGluProPheArgAspTyrValAspArgPheTyrLys 
ThrLeuArgAlaGluGlnAla 

Segment 360-^376 

AlaArgValLeuAlaGluAlaMetSerGlnValThrAsnThrAlaThrlle 
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Alanine 
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Arg 
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Arginine 


Lys 


K Lysine 


Asn 
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Asparagine 


Met™ 


M Methionine 


Asp 
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Aspartic acid 


Phe 


F Phenylalanine 


Cys 
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Cysteine 


Pro 


P Proline 


Gin 


Q 


Glutamine 


Ser 


S Serine 


Glu 
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Glutamic acid 


Thr 


T Threonine 


Gly 
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Glycine 


Trp 


W Tryptophan 


His 
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Histidine 


Tyr 


Y Tyrosine 


He 
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Isoleucine 


Val 


V Valine 
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1. Synthetic peptides which may serve as components of a vaccine for acquired immunodeficiency 
syndrome (AIDS) . characterized In that said peptides are selected by searching for peptide regions which 
can fold as a maximally amphipathic helix and In that the peptides are recognized by T cells immune to the 
acquired immunodeficiency syndrome (AIDS) virus envelope protein. 

2. Synthetic peptides comprising T cell sites envT1 and envT2vimich are residues 428-443 and 112-124, 
respectively, utilizing standard epitope nomenclature. 

3. The preparation of synthetic peptides corresponding to the sites of Claim 2. 

4. Synthetic peptides substantially corresponding to the sites of Claim 2. and any short peptides 
substantially overlapping said sites. 

5. The prediction of T cell antigenic peptides recognized by T lymphocytes and able to stimulate cellular 
immunity for the sequences selected from the group consisting of env T1 (residues 428-443) and env T2 
(residues 112-124) with respect to the acquired immunodeficiency syndrome (AIDS) human T cell 
leiikemiavirus type ill (HTLV-III). 

6. The prediction of T cell antigenic peptides according to Claim 5 wherein the sequence Is env T1 
(residues 428-443). 

7. The prediction of T cell antigenic peptides according to Claim 5 wherein the sequence is env T2 
(residues 112-124). 

8. Peptides corresponding to sequences from HIV viral proteins and selected by the method of 
predicting T-cell antigenic sites by searching for segments which have periodicity of hydrophobicity 
properties wherein said properties exhibit the capability of said segment to form amphipathic helices, and 
wherein said peptide, when included in a vaccine, are capable of Inducing T-cell Immunity to acquired 
immune deficiency syndrome. 

9. The peptides of Claim 8, wherein said sequences are amino acid sequences selected from the group 
consisting of: 

FNI^WKNDI^EQIWIHEDIISLWDQSU<PCVKLTPLCV; KTFNGTGPCTNVSTVQCTHG; IRIQRGPGRAFVTIG- 
KIGNMRQAHC; RAKWNNTLKQIDSiCLREQFGNNK: TIPCRIKQIINMWQEVGKAMYAP; 

NNLLRAIEAQQHLLQLTVWGIK; SNKSLEQIWNNMTWMEWDREINNYTSLIHSLIEESQNQ; ELLELDK- 
WASLWNWFNrrNWLWY; l\m^lVELL6RRGWEALKYWWNLLQYWSQELK; DRVIEWQGAYRAIRHI- 
PRRIRQGLERILL; ASVI-SGGELDRWEKIRLR; ETSEGCRQILGQLQPSLQT; EIKDTKEALDKIEEEQNK; 
SPRTLNAWVKWEEK; DIRQGPKEPFRDYVDRFYKTLRAEQA; ARVLAEAiWISQVTNTATI: HE- 
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DIISLWDQSLK; andKQllNMWQEVQKAMYA. 

10. The peptides of Claim 9, wherein sM T-ceH response Is the Induction of T-cell Immuntty. 

1 1 . The peptides of Ctalm 9. wherein said T-cell response Is the Induction of T-cell help for an antibody 
response. 

12. A synthetic peptide capable of eliciting aT-cell response, said peptide substantially corresponding to 5 
amino acid sequences 428-443 of the HIV envelope protein. 

13. A synthetic peptide capable of eliciting a T-cell response, said peptide substantially corresponding to 
amino acid sequences 1 1 2-124 of the HIV envelope protein. 
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